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1. INTRODUCTION 

In the past decade, several online communities have grown and are important in everyday life. 
Today, it acts as a platform for the communication and interaction process among communities both in 
individual life and in the commerce environment. Thus, the number of data used and utilized has been 
growing significantly both from users and providers. Therefore, this large amount of data is a worthy source 
of study since allowing the study of people’s behaviour and their interactions. Therefore, happiness is 
subjective depending on who you ask the question to. Some people are satisfied with only little fulfilment of 
their needs while some might require larger support from others. Recently, there are advanced progress in 
applying computing technologies [R1-R8] that have significant progress in artificial intelligence [1]-[8]. 
Meanwhile, personal cognition researchers have turned to the study of social media and digital devices to ask 
whether a person’s digital effects can reveal aspects of their identity. At the same time, advances in big data 
analysis have shown that computer algorithms can predict the traits of individuals from their digital effects 
[9]. Everyone has a different source of happiness and degree of fulfilment to achieve that happiness. On a 
global scale, where countries are involved to represent the people, larger factors come into play that could 
have rippling effect worldwide on the happiness level experienced. This research has objective to investigate 
the connection link between happiness and macro level factors namely incomes, life expectancy, social 
support, generosity, corruption and freedom. Utopia is seen as a country in which everyone is happy. A 
utopia can be built without having a complete aggregation of promising statistics of all happiness factors. 
There could be a trade-off elsewhere that could be identified as a non-factor to the construction. Therefore, 
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the problem of determining the right blend of attributes of happiness is evident, thus examining from a bird’s 
eye view will give a clearer understanding of what it takes to reach a utopian state. In general, people often 
spend an average of 20% of their time socializing with others through various channels, especially through 
face-to-face conversations [10]. 


2. MATERIAL AND METHOD 

Aristotle’s wisdom explained that more than anything else, men and women seek happiness. 
Happiness can be defined as a sense of well-being, joy, showing pleasure or contentment. It also can be 
understood as fortunate and convenient. Commonly found, someone will feel happy when she or he is 
successful, safe or lucky. Our world community is adamant to foresight the future population to enjoy 
sustainable happiness. The World Happiness Report [11] has presented the list of global countries based on 
the happiness index from total of 156 countries. They were ranked based on several indicators that point out 
on the level of happiness of average citizen for thinking about themselves. This report explained the 
evolution of happiness that changes over time due to several event or condition. The focus emphasized that 
the technologies, environment, culture, clash, even the government policies has been believed to be the 
drivers for changing the world happiness level. In recent years, many countries in the world seeks to build a 
happy country. For example, New Zealand will spend almost NZ$2 billion for health services which include 
mental health, fully funded hospital and children well-being. Several European countries such Finland, 
Norway and Denmark have been pronounced top three (3) happiest country in the world in 2018 and the 
United Nation, with the sustainable development goal also leading to ensure the world we are living is 
sustainable and people are happy and have quality of life. 

In the view of research, the happiness can be defined as the collaboration of how satisfied person 
with the condition or the quality of his/her life for such as to find out the true meaning of working hard and 
the feelings over daily routine in term of interaction with the others. Interestingly, one research affirms that 
happiness is not sent to external events but in how we interpret them [12]. It is actually a condition that each 
person must prepare, develop and personally. People who learn to control internal experiences that will 
determine the quality of their life, which is the closest thing to us to be happy. Therefore, happiness is in 
inner harmony, while people with inner harmony who led energetic lives are open to diverse experiences, 
continue to learn until the day of their death, and have strong ties and commitments to others and the 
environment in which they live. They enjoy what they do, even if it is boring or difficult, which are almost 
never bored and can calmly accept whatever comes their way. 

Many scales and questionnaires have been developed to measure happiness. Happiness can be 
measured with satisfaction with life scale [13]. In 1988, it have been introduced positive and negative affect 
schedule (PANAS) Scale with measure happiness which consists of two mood scales, measuring positive and 
negative influence [14]. When used as a psychometric scale, PANAS can show the relationship between both 
of the influences with statistics and personality traits. Another scale that directly measure people happiness is 
subjective happiness scale (known as general happiness scale) [15]. The happiness is measured in scale of 7 
from not a very happy to the opposite. Other questions cover comparisons with peers, general people 
happiness and characterization that describe the subject. In year 2002, Argyle and Hill have produced an 
advanced instrument or indicators namely the Oxford Happiness Questionnaire (OHQ), which was the 
modification from the Oxford Happiness Inventory (OHI) [16]. In summary, OHI consist of 29 items that 
include the four optional selection of the conditions [17]. The OHQ consists of similar items as in OHI, 
which measured in six-point Likert scale. Currently, there are many challenges have been studied related to 
predict the happiness specially to adapt to their behavior according to the network behavior such as an 
assessment to find out the user generated text with the social influence model that describe interdependency 
between user emotions and sentiments [18]. Meanwhile, the approach is supposed to not only present a high 
degree of control but also offer external validity in term of exchanging practical information in the process of 
maintaining social relationships [19]. 

The distinctive feature of communication channel has been used become the determinant of 
happiness through various spectrums. Thus, the business transactions involving the electronic commerce and 
the investment has become the main ingredients for other daily routine in term of online exchange, online 
games, streaming videos and music. Actually, user often categorizing those activities based on the platform 
or the supported tools. Therefore, the communication via social media such as Skype, Facebook, Whatsapp 
or Weechat provides less work value in comparison to other online tasks, which mobile services are decided 
upon its distinctive feature led to the greater customer happiness [20]. Given the ultimate question on the 
reason of certain people become happier compare to the others, or others on the motivation of some people 
have the tendency to be happy even though they are in the negative condition, Actually, what is worth noting 
is that self-happiness has been seen as a common concept in psychological study and a place of interest to 
search for. It should be seen as an important part of human life related to human beings’ essential goal. 
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Therefore, self-happiness is a relatively consistent structure over time, which in every condition has an 
impact on how people feel, think, recall and see certain life events in either negative or positive direction 
[21], [22]. The relationship between facial expressions and emotions has been extensively studied, leading to 
primary theories that indicate brief nonverbal cues reveal much about individuals’ emotional states, which 
mostly Ekman’s facial action coding system (FACS) has been used widely to characterized and code the 
smile intensity or level. The dataset used for the project is the World Happiness Report of 2019, which has 
been produced by software-defined satellite networking (SDSN) and extracted by PromptCloud’s custom 
web crawling solution. It was later found in the Kaggle website which is a public data platform for the data 
science and machine learning towards community to work together in exploring and building models while 
engaging in competition to solve challenges. The 2019 World Happiness Report is a landmark survey of the 
ranking list of global happiness countries by tagging with several macro level factors contributed to the 
happiness score of several countries. In performing exploratory analysis of the dataset, Kaggle’s Python is 
used since it provides a well-equipped workspace to carry out programming with ease on the dataset. 


3. RESULTS AND DISCUSSION 

Currently, some studies suggest that in all modes of communication except Skype, participants 
experience more happiness when there is laughter, both tangible and symbolic, which is determined critically 
to the interaction conducted rather than the duration it have taken [19]. Actually, the tradition has suggested 
that happiness is achieved when every domain and subdomain of fulfilment are satisfied, which indicated that 
personal health in term of leisure and work activities should be greater compare to expectation [23], [24]. The 
more interaction occurred among human or the greater time provided to have exercise those are strongly 
linked to happiness [25]. It should take a note that the related factors contributing to combating depression 
such as self-reported happiness, social support and job participation, which provide the defense mechanism 
and capability to face in direct way with the negative life events without becoming depressed. Meanwhile, 
physiological factors also influence susceptibility to depression, and many studies show that a strong 
association between sleep disturbances, subsequent depression, and physical health is closely related to 
depression and happiness [26]-[30]. For the purpose of this study, in (1) the exploratory analysis begins by 
importing the required libraries and defining the functions required to plot the data. The dataset is then read 
into a variable to be stored for further processing, which can be seen in (2). 


import numpy as np 

import pandas as pd 

import matplotlib.pyplot as plt 

import seaborn as sns 

import os (1) 


Data = pd.read_csv(*../world-happiness-report-2019.csv’ ) (2) 


Predictive models use patterns found in transactional and historical data to identify risks and 
opportunities. The model captures the relationships between many factors to allow an assessment of the risk 
or potential associated with a specific set of conditions, guiding decision making for candidate transactions. 
The general overview of the data based on the columns is described: 

— Country (region): Name of the country. 

— Ladder: Measure of life satisfaction. 

— SD of Ladder: Standard deviation of the ladder. 

— Positive affect: Measure of positive emotion. 

— Negative affect: Measure of negative emotion. 

— Social Support: The extent provided to the accumulation of the happiness score. 

— Freedom: The extent provided to the accumulation of the happiness score. 

— Corruption: The extent provided the accumulation of the happiness score. 

— Generosity: The extent provided to the accumulation of the happiness score. 

— Log of gross domestic product (GDP) per capita: The extent provided to the accumulation of the 
happiness score. 

— Healthy life expectancy: The extent provided to the accumulation of the happiness score. 

The happiest countries in the world are listed in Table 1 while the opposite are listed in Table 2. On 
the other hand, renaming certain attribute names is conducted for simplicity as shown in (3) while (4) 
presented the dimensions of the dataset obtained are 156 rows and 9 columns. Futhermore, in Figure 1, 
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predictive analysis is carried out to predict at a more detailed level, namely, to produce a prediction score or 


so-called probability for each element of the organization through (5). This sets it apart from foresight, while, 
the number of rows missing from the value is obtained. 


Table 1. The list of happiest countries 


Country Ladder SD of (+) (-) Social Freedom Corruption Generocity Log of Healthy life 
(region) Ladder affect affect support GDP per expectancy 
capita 
0 Finland 1 4 41.0 10.0 2.0 5.0 4.0 47.0 22.0 27.0 
1 Denmark 2 13 24.0 26.0 4.0 6.0 3.0 22.0 14.0 23.0 
2 Norway 3 8 16.0 29.0 3.0 3.0 8.0 11.0 7.0 12.0 
3 Iceland 4 9 3.0 3.0 1.0 7.0 45.0 3.0 15.0 13.0 
4 Netherlands 5 1 12.0 25.0 15.0 19.0 12.0 7.0 12.0 18.0 
5 Switzerland 6 11 44.0 21.0 13.0 11.0 7.0 16.0 8.0 4.0 
6 Sweden 7 18 34.0 8.0 25.0 10.0 6.0 17.0 13.0 17.0 
7 New Zealand 8 15 22.0 12.0 5.0 8.0 5.0 8.0 26.0 14.0 
8 Canada 9 23 18.0 49.0 20.0 9.0 11.0 14.0 19.0 8.0 
9 Austria 10 10 64.0 24.0 31.0 26.0 19.0 25.0 16.0 15.0 
Table 2. The list of unhappiest countries 
Country Ladder SD of (+) (-) Social Freedom Corruption —_Generocity Log of Healthy life 
(region) Ladder affect affect support GDP per expectancy 
capita 
146 Haiti 147 111 142.0 119.0 46.0 152.0 48.0 20.0 138.0 125.0 
147 Botswana 148 125 87.0 65.0 05.0 60.0 54.0 150.0 66.0 113.0 
148 Syria 149 137 155.0 155.0 54.0 153.0 38.0 69.0 NaN 128.0 
149 Malawi 150 132 110.0 110.0 50.0 65.0 64.0 109.0 147.0 119.0 
150 Yemen 151 85 75.0 75.0 00.0 147.0 83.0 155.0 141.0 124.0 
151 Rwanda 152 63 102.0 102.0 44.0 21.0 2.0 90.0 132.0 103.0 
152 Tanzania 153 122 50.0 50.0 31.0 78.0 34.0 49.0 125.0 118.0 
153 Afghanistan 154 25 133.0 133.0 310 155.0 136.0 137.0 134.0 139.0 
154 Central 155 117 153.0 153.0 55.0 133.0 122.0 113.0 152.0 150.0 
African 
Republic 
155 South Sudan 156 140 152.0 152.0 48.0 154.0 61.0 85.0 140.0 143.0 
data = data.drop([°Ladder*, “SD of Ladder’], axis=1) (3) 
data = data.rename (columns={ Country (region) *: Country’, 


“Positive affect’:*Pos*, “Negative affect’: °Neg’, 
“Log of GDP\nper capita’: °GDP*, 
“Healthy life\nexpectancy :*°Life expectancy’ }) (4) 


data.shape(156, 9) (5) 


ata. info data.isnull().sum() 


<class ‘pandas.core.frame.DataFrame'> 
RangeIndex: 156 entries, @ to 155 Count iy, 


Data columns (total 9 columns): Pos 


Neg 


Country 156 non-null object 
155 non-null float64 


155 non-null float64 Social support 
Social support 155 non-null float64 Freedom 


Freedom 155 non-null float64 
Corruption 148 non-null float64 
Generosity 155 non-null float64 Generosity 

GDP 152 non-null float64 GDP 

Life expectancy 158 non-null float64 = 

dtypes: float64(8), object(1) Life expectancy 


memory usage: 11.6+ KB dtype: int64 


Corruption 


Oh OH 2s 2 = © 


Figure 1. The results from predictive analysis 


In the process, there is an indication for a missing value within the dataset and the data needs to be 
cleaned. Since the data set is rather small, (6) has been used for deleting rows with missing values which is 
the best option for this case. Now the dataset is clean with all rows complete although the size is reduced by 
the values in each column. To dissect the dataset further, a numerical description of each attribute is included. 
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By implementing (7), the information reveals very similar values for each attribute, which signals that a 
ranking system is used to order the countries, thus making the nature of the dataset filled with ordinal 
attributes. Before proceeding to the visualization part, (8) has been used to introduce class labels to group 
countries according to their happiness rating. This divides the dataset into 3 equal size grouping to be made a 
class label. Thus, the dataset is introduced a new attribute called class which has the 3 class labels; 0 for the 
top 46 countries in happiness ranking, | for the middle 46 countries in happiness ranking, 2 for the bottom 46 
countries in happiness ranking. The happiest countries in each grouping are shown in Table 3. Meanwhile, 


the line graph of each of the main attributes with respect to happiness ratings can be seen in Table 4 through 
(9) for classifying process. 


data = data[~data.isnull().any(axis=1) ] 


data.shape 
(140, 9) (6) 
data.describe () (7) 


Table 3. The list of data set for happiness factors 


Pos Neg Social support Freedom Corruption __ Generosity GDP Life expectancy 

count 140.00 140.00 140.00 140.00 140.00 140.00 140.00 140.00 
mean 78.24 79.15 77.50 78.82 75.70 78.85 79.01 75.47 

std 44.33 44.50 45.81 45.10 42.65 44.72 43.35 43.97 
min 1.00 2.00 1.00 1.00 1.00 1.00 2.00 1.00 
25% 40.75 40.75 36.75 39.75 39.75 40.75 41.75 36.75 
50% 78.50 78.50 77.50 79.50 76.50 79.50 78.50 77.50 
75% 116.25 117.25 118.25 118.25 112.25 116.25 117.25 113.25 
max 154.00 154.00 155.00 155.00 148.00 155.00 152.00 150.00 
count 140.00 140.00 140.00 140.00 140.00 140.00 140.00 140.00 


data[°Class*] = 0 


data.iloc[size:2*size][°Class*] = 1 
data.iloc[2*size:][°Class°] = 2 (8) 
data.iloc[[0, size, 2*size], :] (9) 


Table 4. The list of happiest countries after classifying process 


Country Pos Neg Social support Freedom Corruption Generosity GDP Life Class 
expectancy 
0 Finland 41.0 10.0 2.0 5.0 4.0 47.0 22.0 27.0 0 
53 South Korea 101.0 45.0 91.0 144.0 100.0 40.0 27.0 9.0 1 
104 Laos 5.0 112.0 120.0 22.0 27.0 34.0 102.0 112.0 2 


The shape of the graphs confirms the results described as can be seen in appendix of mapping 
attributes with the social support, GDP and life expectancy attributes are having an almost linear line, which 
can be seen in Figures 2 and 3 in Appendix. Meanwhile, the generosity, freedom and corruption attributes are 
unable to approach an appropriate straight line. Maps of the world has function to show the exact location 
where the happiest and unhappiest countries are; plus how the other factors might influencce. For this 
visualization can be seen in appendix of world map visualization, the darker red a country is, the less happy 
or satisfied they are. The lighter colour countries are the happier ones. White colour and black colour 
represent unknown values and missing values respectively as can be seen in Figures 4 and 5 in Appendix. 
The map shows that the North America continent, European continent, and Australia continent are the 
happiest while the Asian continent and the South American continent are in the middle with the African 
continent all the way down when it comes to happiness. This statistic is aligned with the fact that the three 
major groups can be formed as created based on our previously introduced class labels. At first glance, the 
groups of continents show similar characteristics. The triad of the North America continent, European 
continent, and Australia continent contains modern and developed nations. The duo of the Asian continent 
and the South American continent contains fast growing and developing economies. The African continent 
contains mostly of underdeveloped and poor countries. The results can be used to reduce risk by making 
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sensible and effective decisions when validating a person or company based on available data. On the other 
hand, it also aims to detect fraud to track changes in behavior patterns on a network or site by detecting 
anomalies that may indicate such behavior. Meanwhile, it also contributes to a competitive advantage by 
providing valuable information, such as customer data, to have an advantage over other competitors. At some 
level, it can present production efficiency by forecasting inventory, production rates, and potential failures. 

The map for the social support, GDP and life expectancy attributes are very similar to the happiness 
ranking attribute’s one while generosity, freedom and corruption attributes have variation with distinct 
similarities; which further confirms the results elaborated in appendix of attribute mapping. Interestingly, the 
idea that subjective well-being such as happiness index can be captured in research somehow have been 
challenged by economists and politicians. Presumebly, they claimed that people’s assessment of well-being 
should be identified various aspect such as spiritual and ecological, which predominantly those factors also 
can be influenced, for example, by the value of their country’s education system and the social status. Thus, 
the way of subjectivity in measuring happiness index should be consistent and discussed further to allow 
fairer metric to be used respectively. 

The results presented that to make a utopian state, the relevant party should achieve high GDP per 
capita, longer life expectancy and greater social support as well as a corruption-free, freedom-laden and 
generous state. In other hand, telecommunications service providers often use strategies extensively to 
identify potential customers and convince them to stay based on past information and past behavior to predict 
the future [31]-[35]. Meanwhile, physical hardware standardization, fault tolerance, and load balancing are 
key factors that improve the performance of cloud computing that occurs through virtual machine migration 
(VM). Thus, energy savings can be achieved by energy prediction using various machine learning techniques 
[36]-[39]. Cardiorespiratory fitness studies, on the other hand, often use regression analysis to predict heart 
and respiratory conditions and future outcomes [40]. In addition, data identification technology is an effective 
technique for guiding analysts and extracting new knowledge. Therefore, new problematic OLAP emerged, 
linking its principles with data extraction methods to enhance online analysis and not be limited to simple 
research and simple data visualization [41]. Interestingly, common algorithms such as K-nearest neighbors, 
naive bayesian, decision trees, and support vector machines are not consider best in term of effectiveness and 
accuracy by comparing linear regression, additive regression, and decision stumps to show the level of 
effectiveness and accuracy to generate a linear prediction [42]-[44]. 


4. CONCLUSION 

In conclusion, the study reaffirms our presumption that happiness is not a take-all-to-get-it object. 
Actually, it requires the right and proper concoction of factors to obtain it accordingly that is determine 
implicitly by the degree of freedom, corruption, generosity, social support, GDP and life expectancy from the 
country. Meanwhile, on the ethical side of things, the dataset does not contain any personal information of 
the participants who filled the survey, so there is no links that could potentially expose their identity and can 
be used against them and therefore is openly available to the public to be used and shared. 


APPENDIX 


Figure 2. Main attribute with respect to the rank of happiness (continue) 
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Figure 2. Main attribute with respect to the rank of happiness (continue) 
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Figure 2. Main attribute with respect to the rank of happiness (continue) 
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Figure 3. Main attribute with respect to the rank of happiness 
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Figure 4. World map visualization to show where the happiest and least happy countries are plus how other 
factors work out (continue) 
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Figure 4. World map visualization to show where the happiest and least happy countries are plus how other 
factors work out 
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Figure 5. World map visualization to show where the happiest and least happy countries are plus how other 
factors work out (continue) 
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